Gibbs States and the Consistency of Local Density Matrices 
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Abstract 

Suppose we have an n-qubit system, and we are given 
a collection of local density matrices p\ , . . . , p m , where 
each pi describes some subset of the qubits. We say that 
pi , . . . , p m are "consistent" if there exists a global state a 
(on all n qubits) whose reduced density matrices match 
Pi, ■ ■ ■ , Pm- 

We prove the following result: if pi , ... , p m are consis- 
tent with some state a >~ 0, then they are also consistent 
with a state a' of the form a' — (1/Z) exp(M\+- ■ ■+M m ), 
where each Mi is a Hcrmitian matrix acting on the same 
qubits as pi, and Z is a normalizing factor. (This is known 
as a Gibbs state.) Actually, we show a more general re- 
sult, on the consistency of a set of expectation values 
(Ti), . . . , (T r ), where the observables Ti,...,T r need not 
commute. This result was previously proved by Jaynes 
(1957) in the context of the maximum-entropy principle; 
here we provide a somewhat different proof, using prop- 
erties of the partition function. 



1 Introduction 

Many-body systems have an intriguing property: under 
the right circumstances, local interactions can conspire 
to produce long-range or global effects. This behavior 
leads to phase transitions in statistical mechanics, and it 
also appears in combinatorial problems such as 3- SAT. 
If we consider quantum systems, the situation is more 
complicated, due to non-commuting measurements and 
the possibility of entanglement. This leads to new kinds 
of quantum phase transitions 1 , and new examples such 
as the Local Hamiltonian problem |2J. 

A basic question in all of these examples is: if we know 
local information about various parts of a system, what 
can we say about the system as a whole? This paper gives 
one answer to this question, for quantum systems. 



Suppose we have an n-qubit system, and we are given a 
collection of local density matrices pi, ... , p m , where each 
Pi describes a subset Cj C {1, . . . ,n} of the qubits. We 
say that pi , . . . , p m are "consistent" if there exists a global 
state a (on all n qubits) whose reduced density matrices 
match pi, . . . , p m ; in other words, for all i = 1, . . . , m, 

tr{i,...,„}-c i (o') = Pi- 
Clear ly, if pi, . . . , p m are consistent, then whenever two 
density matrices pi and pj describe overlapping subsets of 
qubits (Ci n Cj ^ 0), they must agree on the intersection 
QnCj; that is, tr C4 _ (CinC .^(^) = tr c ._ (CinC . ) (p i ). This 
gives a necessary condition for consistency. 

However, the above condition is not sufficient to guar- 
antee consistency. To see this, consider the following ex- 
ample: we have three qubits, and we are told that qubits 
1 and 2 are in the Bell state |$+) = (|00) + |ll))/\/2, 
and qubits 2 and 3 are also in the same state |$ + ). 
More formally, let p A = |$ + ) ($ + |, A = {1,2}, and let 
Pb = |$ + )($ + |, B = {2,3}. In this case, pa and p B 
both agree on qubit 2, since tvx{pA) = 1/2 = tr 3 (ps). 
But there is no state a on all three qubits such that 
tr3(cr) = pa and tri(cr) = pb] one way to see this is to ap- 
ply the strong subadditivity inequality, S(l, 2, 3) +5(2) < 
5(1, 2) + 5(2, 3). 

Thus the consistency of pi, . . . , p m would seem to be a 
more subtle question. We prove the following result: 

Theorem 1 If pi, ... , p m are consistent with some state 
a^O, then they are also consistent with a state a' of 
the form a' = (1/Z) exp(Mi + • • ■ + M m ), where each 
Mi is a Hermitian matrix acting on the qubits in Ci, and 
Z = tr(exp(Mi H VM m )). 

Here, a >- means that a is a positive definite matrix. 
The state a' is known as a Gibbs state. 

Essentially, this result says that a Gibbs state a' can 
simulate an arbitrary state a >~ 0, with respect to an 
observer who can only access subsets C\ , . . . , C m of the 
qubits. For example, consider a physical system with lo- 
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cal interactions, described by a Hamiltonian H. It is easy 
to see that the ground state of H can be approximated 
by 77 = (l/Z)exp(—j3H), for (3 large; and since H is a 
sum of local terms, 77 is a Gibbs state. Our result extends 
this simple observation to a much more general setting. 

Actually, we prove the following more general result: 
Consider a finite quantum system, and let Xi, . . . , T r be 
observables (Hermitian matrices) . Without loss of gener- 
ality, assume that the collection of matrices I,T\, . . . ,T r 
is linearly independent (over R). We say that a state 
p has expectation values t\, . . . ,t r if tr(T^p) = ti for all 
i = l,...,r. 

Theorem 2 // there exists some state p y which has 
expectation values ti,...,t r , then there exists a state p' 
which has the same expectation values t±,...,t r , and is 
of the form p' = (l/Z) exp(#i7i + •■■ + 9 r T r ), where 
9l, . . . ,6 r £l. 

This statement holds even when the observables 
Ti , . . . , T r do not commute. 

This result was previously proved by Jaynes, as part of 
the maximum entropy principle in statistical mechanics 
Jaynes showed that the Gibbs state p' is the state 
which maximizes the entropy S(p) = — tr(plogp) subject 
to the constraints (Tj) = tf, implicitly, he also showed 
that the Gibbs state p' is always feasible, in the sense 
that it can produce the same expectation values (Ti) as 
an arbitrary state p >- 0. 

However, Jaynes' motivation was somewhat different 
from ours. Jaynes was interested in statistical mechan- 
ics, which deals with large systems with many degrees of 
freedom and only a few constraints. Feasibility is not usu- 
ally a concern in such cases, while the maximum-entropy 
property is crucial in making plausible inferences about 
the "true" state of the system. 

In this paper, we focus on finite quantum systems, with 
many non-commuting constraints; we are interested in 
the relationship between local constraints and the global 
state of the system. For us, feasibility of the Gibbs state 
is important, since it is possible for the system to become 
overdetermined. Statistical inference is less important, 
because the systems we study are small enough that their 
state can be completely determined (at least in principle) . 
Rather than viewing this as an inference problem, we 
can speak directly about what states are allowed under a 
given set of constraints. 

Finally, we prove our result using a technique which 
is different from Jaynes. Jaynes used the Lagrange dual 
of the entropy-maximization problem, while we use some 
analytic properties of the partition function. Our analy- 
sis bears some resemblance to classical results on expo- 
nential families in statistics — although the technical 



details are quite different. Our proof also contains some 
geometric intuition which may be of interest. 

2 Proofs of our results 

First, we will review some useful facts about the partition 
function for a Gibbs state. Then wc will prove theorem 
12 and obtain theorem ^ as a special case. 

2.1 The partition function 

Recall the situation described in theorem |2J we have a 
finite quantum system, and observables T±, . . . , T r , such 
that I,Ti, . . . ,T r are linearly independent (over R). Wc 
are interested in states of the form 

p{6) = exp(0iTi + • • • + 9 r T r )/Z(9), 9eM. r , 

where Z(9) = tr(exp(0iTi + • • • + 9 r T r )). Z(9) is called 
the partition function, and we also define the log partition 
function ip(6) = \ogZ{8). 

Note that, in the above definition, we can translate 
each observable Ti by a multiple of the identity, without 
changing the state p(9). More precisely, if we define new 
observables Pi = T + Xil, with A; € R, we have that: 

exp(6>iPi + --- + 6i r P r ) _ exp(0iTi + • • • + 9 r T r ) 
tr(exp(0iPi + --- + r P r )) ~ tr(exp(0iri + --- + r T r )) - 

Using subscripts T and P to denote the two sets of ob- 
servables, we arrive at the same state, pp(9) — pt{6), 
although the partition functions are different, Zp(9) ^ 
Z T {9). 

The log partition function ip has some nice analytic 
properties: it is convex, and its derivatives encode the ex- 
pectation values of the observables Ti. We briefly sketch 
these results, which can be found in quantum statistical 
mechanics £1], as well as quantum information geometry 
0. 

Proposition 3 if> is convex on R r . 

Proof sketch: This follows from some facts in matrix anal- 
ysis 7 . First, the Golden-Thompson inequality: If A and 
B are Hermitian matrices, then 

tr(exp(^4 + B)) < tr(exp(A) exp(B)). 

Next, a matrix version of Holder's inequality: For any ma- 
trix A, define the Frobenius or Hilbcrt-Schmidt norm to 
be P|| 2 = (tr(AU)) 1 / 2 . Also, let \A\ denote the unique 
positive semidefmite square root of A* A. Then we have 
that, for all square matrices A and £?, 

\\AB\\ 2 <\\\An 1 2 /p \\\B\''\\l /(! , 
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for i + i = 1, p> 1. □ 

Proposition 4 ^ is differentiable on W , and 

f|=tr(T i/9 (0)) = (^>. 

Proof sketch: Use "parameter differentiation" 8 : If -ff 
is a Hermitian matrix which depends on a parameter A, 
and dH/dX and d 2 H/d\ 2 exist and are continuous, then 
d(exp(H))/dX exists and is equal to 

OH 
— — exp(if) = / exp((l — u)H) — - exp(uH)du. □ 
oX J dX 

2.2 Proof of theorem 2 

Proof: We are given expectation values t\, . . . ,t r , and we 
want to find a state 

p'(9) = exp(6» 1 T 1 + • • • + 9 r T r )/Z'(9) 

that has these expectation values. (Here, Z'(9) is the par- 
tition function, and ift'(@) = logZ'(9) is the log partition 
function.) By translating the observables Tj, we can as- 
sume that ti = 0, for all i = 1, . . . , r. We can now restate 
the problem in terms of the log partition function: we are 
looking for some 9 € R r such that VV>'(0) = 0. 

We know there exists a state p >- which 
has the desired expectation values ti,...,t r - Now 
choose some observables Ui,...,U s , such that the set 
{/, T\, . . . , T r , U\, . . . , U s } is complete and linearly inde- 
pendent (in other words, any 2™-dimensional Hermitian 
matrix can be written uniquely as a real linear combina- 
tion of the matrices in this set). Let u\, . . . , u s be the ex- 
pectation values of p for the observables U±, . . . , U s ; that 
is, Ui — tr (Up). By translating the U, we can assume 
that Ui — 0, for all i = 1, . . . , s. 

We will consider states of the form 

p(9, <j>) = exp(6»iTi + • • • + 9 r T r + 

4> l U 1 +--- + 4> s U s )/Z{0,<l>). 

(Here, Z(9, 4>) is the partition function, and ip(9, <fi) — 
log Z(9, </>) is the log partition function.) Completeness 
of the Ti and the Ui implies that we can write p in the 
form p = p(9,(f>) for some (9, 4>) € R r+S . This implies 
that Vip(6, 4>) = for some (9, </>) e R r+S . 

Furthermore, we claim that there is a unique point 
(9, 4>) such that p(9, (f>) has the expectation values ti and 
Ui. This is because the expectation values ti and Ui 
uniquely determine the state p, and setting p — p(9, (ft) 
uniquely determines the values of 9 and (ft. This in turn 
follows from the completeness and linear independence of 



the Ti and the U. So we conclude that Vip(9, (ft) = at 
exactly one point {9, (ft). 

To complete the proof, we will carry out the following 
plan: we will show that ij){9,(ft) — > oo as \\9, <j>\\ — * oo, 
where \\9,(ft\\ denotes the norm of the vector (9, (ft). This 
implies that *ft'{9) — > oo as \\9\\ — > oo; and hence Vip'(9) = 
for some 9 £M. r . (See figure^for a simple example that 
shows the geometric intuition for the proof.) 

Let (9o,(fto) be the unique point where vanishes. 
We claim that (9o,(fto) is the unique global minimum of 
tp. [Since tp is convex (proposition EJ, it follows that tft is 
bounded below, and (6*o,</>o) is a global minimum. Also, 
ift is differentiable everywhere on the domain R r+S , which 
has no boundaries (proposition^ ; so any extremum (9, (ft) 
must satisfy Vtp(9 7 (ft) = 0. But this happens only at 
(#o, (fto), and so (#o, 4>s) is the unique global minimum.] 

Let S be the set of all unit vectors in R r+S . Define 
the function f(v,z) = i/j((9 ,(fto) + zv), for v £ S, and 
z € K. Say we fix z = 1. We claim that there exists 
some b > such that, for all u, f(v, 1) > ip(9 ,(f> ) + b. 
[Since (9o,(fto) is the unique global minimum, we have 
that f(v, 1) > ^(&a, (fto), for all v. Moreover, f(v, 1) is a 
continuous function of v, and S is a compact set, hence its 
image f(S, 1) is compact. Hence f{v, 1) must be bounded 
away from ift(0o, (fto), for all v.] 

Next we claim that, for all v, and for all z > 1, 
(df/dz)(is,z) > b. [Fix any v. j(y,z) is a differ- 
entiable function of z, so by the mean value theorem, 
there exists some z £ (0,1) such that (df /dz)(v,z) = 
f(y,l) — f(v,0) > b. In addition, since tft is convex, 
{df/dz){y,z) is nondecreasing in z. This proves the 
claim.] 

Now, say we are given some {9, (ft), and assume that 
|| (8, (ft) — (9o, (fto)\\ > 1- We can write (9, (ft) in the form 

(M) = (0o,to) + ||(M)-(0o,<fo)||i>, 

for some unit vector v G S. Then we have: 
1>(0,4>) = f(y,\\{e,<j>)-(0 ,<h)\\) 

|| (0o,0o)|| 

= l) + y (df/dz){v,z)dz 

> ift(0 Q ,(f>o) + b + 6(11(0, <ft) - (0o, <MH - 1) 
= V(0o,^o)+6||(0,^)-(0o,^o)||. 

From this, we conclude that ip(9, (ft) oo as ||0, (ft\\ — * oo. 

Notice that the partition functions for p'(9) and p(9, (ft) 
are related: 

^'(0)=^(0,O). 

Hence, ift'(9) — * oo as ||0|| — > oo. 

We will use the following fact: if / : M™ — > R is con- 
tinuous, and f(x) — * oo as ||a:|| — * oo, then / is bounded 



3 



below, and / attains its minimum at some point £ R n . 
[To see this, let S = {x £ M" | f(x) < a}, choosing 
a large enough that 5^0. Note that S is bounded; 
otherwise, there would exist a sequence {xi} such that 
— > co and f(xi) < a, a contradiction. Also, note 
that S is closed; this is because the interval (—00, a] is 
closed, and / is continuous. So we have that S is compact. 
This implies that f(S) is compact. Hence f(S) is closed 
and bounded; also note that f(S) ^ 0. This implies that 
/ is bounded below, and attains its minimum.] 

From this, we conclude that ip' attains its minimum 
at some point 6** € K r . W has no boundaries, and 
ip' is differentiable everywhere on K r , so it follows that 
V^'(^*) = 0. This completes the proof. □ 



2.3 Proof of theorem [T] 

Proof: We will obtain theorem as a special case of the- 
orem |21 The basic idea is that specifying the local den- 
sity matrices p\ , . . . , p m is equivalent to specifying the 
expectation values of all Pauli matrices on the subsets 

Cl 7 ■ ■ ■ 1 Cm ■ 

Let X, Y and Z denote the Pauli matrices for a single 
qubit, and define V = {/, X, Y, Z}. We can construct n- 
qubit Pauli matrices by taking tensor products P = Pi ® 
• • • ® P n € V® n . Any 2 Tl -dimensional Hermitian matrix 
can be written as a real linear combination of n-qubit 
Pauli matrices. Furthermore, the n-qubit Pauli matrices 
are orthogonal with respect to the Hilbert-Schmidt inner 
product: tr(PtQ) = 2" if P = Q, and otherwise. 

We make the following claim: Let a be a density matrix 
on n qubits, and let p be a density matrix on a subset of 
the qubits C C {1, . . . ,n}, with \C\ = k. We claim that 
tr{i,...,n}-c( cr ) = p, if and only if, for all Pauli matrices 
P on the subset C, tr((P® I) a) = tr(Pp). (Notation: we 
write n-qubit Pauli matrices in the form P ® Q, where 
P acts on the subset C, and Q acts on the rest of the 
qubits.) 

The (=>■) direction is obvious, but we need to show 
(<=)■ We write a and p as linear combinations of Pauli 
matrices, with real coefficients P(p^q) and cup: 

0= P(P®Q) P ®Q 
p= Y a P R 

P(Zj>^)k 

We know that, for all Pauli matrices P on the subset C, 
tr((P <g> i» = 2 n p {P ® I) = tr(Pp) = 2 fc a P . But this 



implies: 

tr { i,..., n} _ c (<7) = Y, 2 n ~ k P(P®i)P 

which proves the claim. 

Thus, theorem ^ is a special case of theorem where 
the observables T\ , . . . , T r consist of all the Pauli matrices 
on the subsets Ci, . . . , C m . □ 

Acknowledgements: I am grateful to Dorit Aharonov, 
Chris Fuchs and David Meyer for helpful discussions 
about this work. Funded by an ARO/NSA Quantum 
Computing Graduate Research Fellowship. 

References 

[1] S. Sachdcv, Quantum Phase Transitions, Cambridge 
University Press, 2000. 

[2] D. Aharono v and T. Nav eh, "Quantum NP - A Sur- 
vey," Arxiv: |quant-ph/0210077| 

[3] E.T. Jaynes, "Information Theory and Statistical 
Mechanics II," Phys. Rev. (2) 108 (1957), pp.171- 
190. 

[4] E.T. Jaynes, "Information Theory and Statistical 
Mechanics," lectures at Brandeis in 1962. Reprinted 
in E. T. Jaynes: Papers on Probability, Statistics and 
Statistical Physics, R.D. Rosenkrantz (ed.), D. Rei- 
del Publishing Company, 1983. 

[5] L.D. Brown, Fundamentals of Statistical Exponential 
Families, Institute of Mathematical Statistics, 1986. 

[6] R.S. Ingarden, H. Janyszek, A. Kossakowski and T. 
Kawaguchi, "Information geometry of quantum sta- 
tistical systems," Tensor (N.S.) 37 (1982), pp.105- 
111. 

[7] R. Bhatia, Matrix Analysis, Graduate Texts in 
Mathematics, Springer- Verlag, 1997. 

[8] R.M. Wilcox, "Exponential Operators and Parame- 
ter Differentiation in Quantum Physics," J. Mathe- 
matical Physics 8 (1967), pp.962-982. 



4 




Figure 1: A single-qubit example. We want to find a 
Gibbs state p' that satisfies (a z ) = —0.6; we have one 
observable T = <j z + 0.6. We know there exists some 
state p y that satisfies (a z ) = —0.6; in this case, p 
also satisfies (a x ) = —0.3, and we let U = a x + 0.3 play 
the role of the "extra" observables. As the graph shows, 
'Vip(6,cp) vanishes at exactly one point; ip'(6) = ?/>(#, 0); 
and V"4>'{0) vanishes for some 6. 
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